Interactive Hesitation Synthesis: Modelling and Evaluation

نویسندگان

  • Simon Betz
  • Birte Carlmeyer
  • Petra Wagner
  • Britta Wrede
چکیده

Conversational spoken dialogue systems that interact with the user rather than merely reading the text can be equipped with hesitations to manage dialogue flow and user attention. Based on a series of empirical studies, we elaborated a hesitation synthesis strategy for dialogue systems, which inserts hesitations of a scalable extent wherever needed in the ongoing utterance. Previously, evaluations of hesitation systems have shown that synthesis quality is affected negatively by hesitations, but that they result in improvements of interaction quality. We argue that due to its conversational nature, hesitation synthesis needs interactive evaluation rather than traditional mean opinion score (MOS)-based questionnaires. To validate this claim, we dually evaluate our system’s speech synthesis component, on the one hand, linked to the dialogue system evaluation, and on the other hand, in a traditional MOS way. We are thus able to analyze and discuss differences that arise due to the evaluation methodology. Our results suggest that MOS scales are not sufficient to assess speech synthesis quality, leading to implications for future research that are discussed in this paper. Furthermore, our results indicate that synthetic hesitations are able to increase task performance and that an elaborated hesitation strategy is necessary to avoid likability issues.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive Hesitation Synthesis and Its Evaluation

Conversational spoken dialogue systems that interact with the user rather than merely 1 reading text can be equipped with hesitations to manage the dialogue flow and the users’ attention. 2 Based on a series of empirical studies, we built an elaborated hesitation synthesis strategy for 3 dialogue systems that inserts hesitations of scalable extent wherever needed in the ongoing 4 utterance. So ...

متن کامل

Modelling Hesitation for Synthesis of Spontaneous Speech

The current work deals with the modelling of one type of disfluency, hesitations. A perceptual experiment using speech synthesis was designed to evaluate two duration features found to be correlates to hesitation, pause duration and final lengthening. A variation of F0 slope before the hesitation was also included. The most important finding is that it is the total duration increase that is the...

متن کامل

Slovak Speech Database for Experiments and Application Building in Unit-Selection Speech Synthesis

After the years of hesitation the conservative Slovak telecommunication market seems to become conscious of the need of voice driven services. In the last year, all the three telecommunication operators have adopted our text to speech system Kempelen in their interactive voice response systems. The diphone concatenative synthesis has probably reached the frontier of its abilities and so the nex...

متن کامل

Synthesis and Experimental-Modelling Evaluation of Nanoparticles Movements by Novel Surfactant on Water Injection: An Approach on Mechanical Formation Damage Control and Pore Size Distribution

Water injection is used as a widespread IOR/EOR method and promising formation damages (especially mechanical ones) is a crucial challenge in the near-wellbore of injection wells. The magnesium oxide (MgO) NanoParticles (NPs) considered in the article underwater flooding experiment tests to monitor the promising mechanical formation damage (size exclusion) in lab mechanistic scale include m...

متن کامل

On the functions of the vocalic hesitation euh in interactive man-machine question answering dialogs in French

This paper deals with the functions of the French vocalic hesitation euh in interactive speech of man-machine question answering dialogs. The present analysis suggests that the vocalic hesitation euh may carry various properties in speech, both disfluent signaling the speakers’ efforts to put the intended message under production into appropriate words, and fluent, as markers of discourse struc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018